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PRIORITY CLAIM 



This application is a continuing application of and claims priority to U.S. Patent 
Application No. 10/389,290 entitled "Systems and Methods For Assessing Documents 
5 Using Analysis of Machine-Printed Writing and Pre-Printed hiformation" filed by Houle, 
et al. on March 14, 2003, which claims priority to U.S. Provisional Application No. 
60/364,675 entitled "Systems and Methods for Handwriting Analysis in Documents," 
filedMarchl5,2002. 

10 BACKGROUND OF THE INVENTION 

1. Field of the Invention 

The present invention generally relates to analyzing information in documents 
15 such as payment instruments. Certain embodiments relate to computer-implemented 
systems and methods for analyzing and assessing documents. 

2. Description of the Related Art 

20 Fraud related to forgery of documents, such as checks, has increased steadily 

worldwide over the past few years. For example, in Europe fraud has doubled in the past 
two years. This is a very difficult problem mainly because of the wide range of 
techniques used to reroute money from an account to a fraudulent account. Fraud may be 
found in any document-based business where money transfers take place. There has been 

25 a significant amount of effort applied in developing technology, such as signature 
verification, for assessing forgeries in financial documents. 

Many financial institutions, such as banks, are required to keep copies of 
processed financial documents for a long period of time, for example, months, and even 
30 years. Such institutions commonly employ image-based financial document systems that 
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store images of processed documents in the form of images on a database on a computer 
system. 

Databases including images depicting handwriting known to be authentic are an 
5 important resource for methods and systems of assessing forgery. A handwriting sample 
of unknown validity, such as a signature, may be compared to images in such a database 
to determine if the handwriting sample is a forgery. However, such a process may be 
difficult and expensive if the database includes a very large amount of image data. In 
addition, many methods and systems for assessing forgery in financial documents focus 
10 on assessing forgery in a limited portion of the document, for example, of a signature. 
Such methods and systems may lead to a large number of financial documents being 
incorrectly labeled as containing forgeries, as well as failing to identify forged contents in 
non-signature portions of a document. 

15 U.S. Patent No. 6,157,73 1 Hu et al. discloses a signature verification method 

and is incorporated by reference as if fully set forth herein. The method involves 
segmenting a smoothed and normalized signature and, for each segment, evaluating at 
least one local feature to obtain a feature value vector. 

20 A method and system of recognizing handwritten words in scanned documents is 

disclosed in U.S. Patent No. 6,108,444 to Syeda-Mahmood and is incorporated by 
reference as if fully set forth herein. A method of detecting and recognizing handwritten 
words is described. The applications described in the patent are directed to the use of 
handwriting recognition algorithms as part of keyword searches. 

25 

U.S. Patent No. 6,084,985 to Dolfmg et al. discloses a method for on-line 
handwriting recognition and is incorporated by reference as if fully set forth herein. The 
method employs feature vectors based on aggregated observations. 
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U.S. Patent No. 5,995,953 to Rindtorff et al. discloses a method of comparing 
handwriting and signatures and is incorporated by reference as if fully set forth herein. 
The method relies on comparison of features of a signature rather than the images of 
signatures. 

5 

U.S. Patent No. 5,909,500 to Moore discloses a method of signature verification 
and is incorporated by reference as if fully set forth herein. The method is based on 
analysis of the environs attendant to the signature string. 

10 U.S. Patent No. 5,710,916 to Barbara et al. discloses a method and apparatus for 

similarity matching of handwritten data objects and is incorporated by reference as if fiiUy 
set forth herein. 

A method of signature verification is disclosed in U.S. Patent No. 5,828,772 to 
15 Kashi et al. The method compares the numerical values of parameters evaluated on a trial 
signature with stored reference data derived from previously entered reference signatures. 

U.S. Patent No. 5,680,470 to Moussa et al. discloses a method of automated 
signature verification and is incorporated by reference as if fully set forth herein. In the 
20 method, a test signature, for example, a signature entered by an operator, may be 

preprocessed and examined for test features. The test features may be compared against 
features of a set of template signatures, and verified in response to the presence or 
absence of the test features in the template signatures. 

25 U.S. Patent No. 5,454,046 to Carman discloses a universal handwriting 

recognition system and is incorporated by reference as if fully set forth herein. The 
system converts user-entered time ordered stroke sequences into computer readable text. 
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SUMMARY OF THE INVENTION 



An embodiment of the present invention relates to a computer-implemented 
method for analyzing and assessing fraud in documents. Analysis and assessment of 
5 documents may use a profile created for authorized writers of a document. 

In one embodiment, a method of generating a writing profile on a computer 
system may include providing one or more documents to the computer system. In some 
embodiments, at least one of the documents may include at least one information field. In 

10 other embodiments, at least one of the documents may include at least two information 
fields. The method may fiirther include determining at least one writing profile 
representation for at least two of the information fields using writing from at least one of 
the information fields. Altematively, the method may include determining at least one 
writing profile representation for at least one of the information fields using writing from 

15 at least two of the information fields. In other embodiments, at least two writing profile 
representations for at least one of the information fields may be assessed using writing 
from at least one of the information fields. 

In an embodiment, a method of generating a writing profile on a computer system 
20 may further include providing one or more additional documents to the computer system. 
At least one of the additional documents may include at least one information field. In 
another embodiment, at least one of the additional documents may include at least two 
information fields. The method may fiirther include updating at least one of the writing 
profile representations using at least one of the information fields of at least one of the 
25 additional documents. 

In an embodiment, a method of assessing a document using a computer system 
may include providing a document to the computer system. In some embodiments, the 
document may include at least one inforaiation field. Altematively, the document may 
30 also include at least two information fields. The method may fiirther include comparing 
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writing in at least two of the information fields of the document to at least one writing 
profile representation. At least one writing profile representation may be firom at least 
one information field of at least one other document. Altematively, the method may 
include comparing writing in at least one of the information fields of the document to at 
5 least one writing profile representation. At least one writing profile representation may 
be firom at least two information fields of at least one other document. In other 
embodiments, writing in at least one of the information fields of the document may be 
compared to at least two writing profile representations. At least two writing profile 
representations may be fi-om at least one information field of at least one other document. 

10 

In one embodiment, a method of assessing information in a document using a 
computer system may include obtaining information on writing in an information field of 
a dociunent. The document may include at least two information fields. The method may 
fiirther include comparing the obtained written information in the information field and 

15 written information in at least one other information field to at least one writing profile 
representation. In another embodiment, the method may include comparing the obtained 
written information in the information field and written information in at least two other 
information fields to at least one writing profile representation. Altematively, the 
obtained written information in the information field and written information in at least 

20 one other information field may be compared to at least two writing profile 
representations from at least one other document. 

In some embodiments, at least one of the writing profile representations may 
include written information from the information field and written information from at 

25 least one of the other information fields. In other embodiments, at least one of the writing 
profile representations may include written information from the information field and 
written information from at least two of the other information fields from at least the one 
of the other documents. Altematively, at least two of the writing profile representations 
may include written information from the information field and written information from 

30 at least one of the other information fields from at least the one other document. 
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In an embodiment, a method of identifying a document with forged inforaiation 
using a computer system may include providing a document to the computer system. The 
document may include at least one information field. Alternatively, the document may 
5 include at least two information fields. The method may further include comparing 
writing in at least two of the information fields of the document to at least one forger 
writing profile representation. At least one forger writing profile representation may be 
fi:om at least one information field of at least one document that includes forged 
information, hi another embodiment, the method may include comparing writing in at 

10 least one of the information fields of the document to at least one forger writing profile 
representation. At least one forger writing profile may be from at least two information 
fields of at least one docimient that includes forger information. Alternatively, writing in 
at least one of the information fields of the document may be compared to at least two 
forger writing profile representations. At least one forger writing profile representation 

15 may be from at least one information field of at least one document that includes forger 
information. 

The method may additionally include identifying the document as a document that 
includes forged information. The identification may be made firom an approximate match 
20 of at least one forger writing profile representation with writing in the document. 

Li certain embodiments, a method of capturing written information from an 
information field of a document using a computer system may include providing a 
document to the computer system. The document may include at least one information 

25 field. The method may further include assessing whether writing in an information field 
approximately matches a writing profile representation. The writing profile 
representation may be from at least one information field from at least one other 
document. In an embodiment, at least one matching writing profile representation is 
associated with a corresponding text representation in a computer processable format in 

30 memory on the computer system. Additionally, the method may include associating the 
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information field with the text representation corresponding to the matching writing 
profile representation. 

In an embodiment, a method of assessing a document using a computer system 
may include providing a document to the computer system. In some embodiments, the 
document may include at least one information field. Altematively, the document may 
include at least two information fields. The method may fiirther include comparing pre- 
printed information in at least two of the information fields of the document to at least 
one pre-printed profile representation. At least one pre-printed profile representation may 
be fi*om at least one information field of at least one other document. Altematively, the 
method may include comparing pre-printed text in at least one of the information fields of 
the document to at least one pre-printed profile representation. At least one pre-printed 
profile representation may be fi-om at least one information field of at least one other 
document. In other embodiments, pre-printed information in at least one of the 
information fields of the document may be compared to at least two pre-printed profile 
representations. At least two pre-printed profile representations may be firom at least one 
information field of at least one other document. 
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BRIEF DESCRIPTION OF THE DRAWINGS 



A better understanding of the present invention may be obtained when the 
following detailed description of preferred embodiments is considered in conjimction 
5 with the following drawings, in which: 

Fig. 1 depicts an embodiment of a network diagram of a wide area network 
suitable for implementing various embodiments; 

10 Fig. 2 depicts an embodiment of a computer system suitable for implementing 

various embodiments; 

FIG. 3 illustrates an embodiment of a system and method for analyzing 
documents; 

15 

FIG. 4 depicts an illustration of a check; 

FIG. 5 depicts an illustration of a giro; 

20 FIG. 6 depicts a flow chart of a method for assessing fraud in documents; 

FIG. 7 illustrates writing features included in mathematical representations of 
writing; 

25 FIG. 8 illustrates writing features included in mathematical representations of 

writing; 

FIG. 9 illustrates writing features included in mathematical representations of 
writing; 

30 
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FIG. 10 illustrates legal amount entries in a legal amount field; 

FIG. 1 1 depicts a flow chart of a method of generating a writing profile; 

5 FIG. 12 illustrates determining a handwriting profile fi-om handwriting samples; 

FIG. 13 illustrates dynamic variation of handwriting; 

FIG. 14 depicts a flow chart of a method of generating a writing profile firom 
10 images in a computer database; 

Fig. 15 depicts a flow chart of a method of generating a writing profile from 
images presented for processing; 

15 FIG. 16 depicts a flow chart of a method for assessing a document; 

FIG. 17 depicts a flow chart of a method for assessing a document; 

FIG. 18 depicts a flow chart of a method for assessing a document; 

20 

FIGs. 19 and 20 illustrate assessing fi-aud in the signature field of a giro; 
FIG. 21 is an illustration of assessing fi-aud in a check; 
25 FIG. 22 is an illustration of assessing fi'aud in a giro; 

FIG. 23 is an illustration of assessing fi-aud in the city field of a giro; 
FIG. 24 depicts a flow chart of a method for assessing a document; 

30 
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FIG. 25 illustrates converting a character in a handwriting image to a 
mathematical representation; 

FIG. 26 illustrates assessing fraud in a numeric field of a payment instrument; 

FIG. 27 depicts a flow chart of a method for assessing a document; 

FIG. 28 is an illustration of assessing fraud in a check; 

FIG. 29 depicts a flow chart of a method for assessing a document; 

FIG. 30 illustrates assessing fraud in a date field of a payment instrument; 

FIG. 3 1 depicts a flow chart of a method for assessing a document; 

FIG. 32 illustrates assessing fraud in a city field of a giro; 

FIG. 33 is an illustration of assessing fraud in a check; 

FIG. 34 depicts a flow chart of a method for assessing a document; 

FIG. 35 depicts a flow chart of a method for assessing a document; 

FIG. 36 depicts stock characteristics of a check; 

FIG. 37 depicts a flow chart of a method for assessing a document; 

FIG. 38 illustrates assessing fraud in a giro; 

FIG. 39 depicts a flow chart of a method for assessing a document; 
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FIGs. 40a-d illustrate assessing fraud in a giro; 

FIG. 41 depicts a flow chart of a method for assessing a document; 

FIG. 42 depicts a flow chart of a method for capturing written information from a 
document; and 

FIG. 43 illustrates capturing written information from a document. 

While the invention is susceptible to various modifications and altemative forms, 
specific embodiments thereof are shown by way of example in the drawings and will 
herein be described in detail. It should be understood, however, that the drawings and 
detailed description thereto are not intended to limit the invention to the particular form 
disclosed, but on the contrary, the intention is to cover all modifications, equivalents and 
alternatives falling within the spirit and scope of the present invention as defined by the 
appended claims. 
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DETAILED DESCRIPTION OF SEVERAL EMBODIMENTS 



Fig. 1 illustrates a wide area network ("WAN") according to one embodiment. 
WAN 102 may be a network that spans a relatively large geographical area. The Internet is 
5 an example of WAN 102. WAN 102 typically includes a plurality of computer systems that 
may be interconnected through one or more networks. Although one particular 
configuration is shown in Fig. 1, WAN 102 may include a variety of heterogeneous 
computer systems and networks that may be interconnected in a variety of ways and that 
may run a variety of software appUcations. 

10 

One or more local area networks ("LANs") 104 may be coupled to WAN 102. 
LAN 104 may be a network that spans a relatively small area. Typically, LAN 104 may 
be confined to a single building or group of buildings. Each node (i.e., individual 
computer system or device) on LAN 104 may have its own CPU with which it may 

15 execute programs, and each node may also be able to access data and devices anywhere 
on LAN 104. LAN 104, thus, may allow many users to share devices (e.g., printers) and 
data stored on file servers. LAN 104 may be characterized by a variety of types of 
topology (i.e., the geometric arrangement of devices on the network), of protocols (i.e., 
the rules and encoding specifications for sending data, and whether the network uses a 

20 peer-to-peer or client/server architecture), and of media (e.g., twisted-pair wire, coaxial 
cables, fiber optic cables, and/or radio waves). 

Each LAN 104 may include a plurahty of interconnected computer systems and 
optionally one or more other devices such as one or more workstations 1 10a, one or more 
25 personal computers 1 12a, one or more laptop or notebook computer systems 1 14, one or 
more server computer systems 116, and one or more network printers 118. As illustrated 
in Fig. 1, an example LAN 104 may include one of each computer systems 11 Oa, 1 12a, 
1 14, and 1 16, and one printer 118. LAN 104 may be coupled to other computer systems 
and/or other devices and/or other LANs 104 through WAN 102. 

30 
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One or more mainfraine computer systems 120 may be coupled to WAN 102. As 
shown, mainframe 120 may be coupled to a storage device or file server 124 and 
mainframe terminals 122a, 122b, and 122c. Mainframe terminals 122a, 122b, and 122c 
may access data stored in the storage device or file server 124 coupled to or included in 
5 mainframe computer system 120. 

WAN 102 may also include computer systems connected to WAN 102 
individually and not through LAN 104 for purposes of example, workstation 1 10b and 
personal computer 1 12b. For example, WAN 102 may include computer systems that 
10 may be geographically remote and connected to each other through the Internet. 

Fig. 2 illustrates an embodiment of computer system 150 that may be suitable for 
implementing various embodiments of a system and method for analyzing and assessing 
documents. Each computer system 150 typically includes components such as CPU 152 

15 with an associated memory medium such as floppy disks 160. The memory medium may 
store program instructions for computer programs. The program instructions may be 
executable by CPU 152. Computer system 150 may further include a display device such 
as monitor 154, an alphanumeric input device such as keyboard 156, and a directional 
input device such as mouse 158. Computer system 150 maybe operable to execute the 

20 computer programs to implement computer-implemented systems and methods for 
analyzing and assessing documents. 

Computer system 150 may include a memory medium on which computer programs 
according to various embodiments may be stored. The term "memory medium" is intended 

25 to include an installation medium, e.g., a CD-ROM or floppy disks 160, a computer system 
memory such as DRAM, SRAM, EDO RAM, Rambus RAM, etc., or a non- volatile 
memory such as a magnetic media, e.g., a hard drive or optical storage. The memory 
medium may also include other types of memory or combinations thereof In addition, the 
memory medium may be located in a first computer which executes the programs or may be 

30 located in a second different computer which connects to the first computer over a network. 
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In the latter instance, the second computer may provide the program instructions to the first 
computer for execution. Also, computer system 150 may take various forms such as a 
personal computer system, mainfi:^e computer system, workstation, network appliance, 
Internet appliance, personal digital assistant ("PDA"), television system or other device. In 
5 general, the term "computer system" may refer to any device having a processor that 
executes instructions from a memory medium. 

The memory medium may store a software program or programs operable to 
implement a method for analyzing and assessing docimients. The software program(s) 

10 may be implemented in various ways, including, but not limited to, procedure-based 

techniques, component-based techniques, and/or object-oriented techniques, among others. 
For example, the software programs may be implemented using ActiveX controls, C-h- 
objects, JavaBeans, Microsoft Foundation Classes ("MFC"), browser-based appUcations 
(e.g., Java applets), traditional programs, or other technologies or methodologies, as desired. 

1 5 A CPU such as host CPU 1 52 executing code and data from the memory medium may 
include a means for creating and executing the software program or programs according 
to the embodiments described herein. 

Various embodiments may also include receiving or storing instructions and/or 
20 data implemented in accordance with the foregoing description upon a carrier medium. 
Suitable carrier media may include storage media or memory media such as magnetic or 
optical media, e.g., disk or CD-ROM, as well as signals such as electrical, 
electromagnetic, or digital signals, may be conveyed via a communication medium such 
as networks 102 and/or 104 and/or a wireless link. 

25 

The systems and methods disclosed herein for analyzing and assessing documents 
may be applied to various kinds of documents that include handwriting and other 
machine-printed information. Documents may be analyzed and assessed for fraud or 
forgery using a profile created for one or more authorized writers of a document. Writers 
30 may include individuals, entities, and/or representatives of entities. Writers may also 
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include machines or devices that print writing for individuals or entities. The profile may 
contain writing characteristics for one or more authorized writers. As used herein, 
"writing" may refer to, but is not limited to characters and symbols formed by an 
individual with an instrument (e.g., pen, pencil, stylus, rubber stamp, etc.) and/or formed 
5 by a machine (e.g., printer, typewriter, etc). As used herein, "handwriting" may refer to 
writing done by an individual with a writing implement, in particular, the form of writing 
peculiar to a particular person. As used herein, "machine-printed writing" may refer to 
writing formed by a machine. Documents may include, but are not limited to, payment 
instruments, receipts, securities dociunents, invoices, account applications, leases, 

10 contracts, credit card receipts and slips, loan applications, credit cards, debit cards, school 
applications, government documents such as social security cards or driver licenses, and 
legal documents such as wills or divorce decrees. As used herein, "forgery" refers to 
falsely and fraudulently making or altering a document. A document may be forged with 
handwriting, a machine, and/or by other means. A forger may make or create an entire 

1 5 document or alter only a portion of a document. For example, a forger may obtain a 
check of an account owner containing no entries and enter information necessary to 
obtain a payment. Altematively, a forger may obtain a check complete with entries of an 
account owner and alter one or more portions of the check. 



20 For example, payment instruments may include various types of commercial paper 

such as a draft. As used herein, a "draft" is an order to pay. Generally, a draft involves 
three parties. One party, the "drawer," orders another party, the drawee (often a bank), to 
pay money to a third party, the "payee," or to a bearer of the draft. A "check" is any draft 
drawn on a bank and payable on demand. Altematively, a payment instrument may 

25 include a "giro." A "giro" is a check-Hke payment instrument commonly used to make 
payments in many European countries. 



In one embodiment, a document may include variable written information and 
stock characteristics. Stock characteristics refer to pre-printed information that tends not 
30 to vary on a particular set of documents. A set of payment instruments for a payment 
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instrument account may include one or more stock characteristics. For example, stock 
characteristics or pre-printed information may include machine-printed text blocks, 
graphics elements (e.g., bank logo), and the relative positions and/or locations of other 
stock characteristics. Machine-printed text blocks may include, for example, the name 
5 and address of one or more account owners and account numbers. Altematively, variable 
written information or writing refers to writing that tends to vary on a particular set of 
documents. The content of variable written information may depend on a particular 
purpose or transaction. For example, for a payment instrument, variable written 
information may include a payee, courtesy amount, date, etc. 

10 

Furthermore, a document may include one or more information fields. In one 
embodiment, an "information field" may be a portion of a document for entering variable 
written information. For example, the one or more accoimt owners of a checking account 
are a set of writers or individuals that may enter written information in various portions of 

15 a check. For instance, an account owner may write his or her signature in the signature 
field of a check corresponding to the accoimt of the account owner. As used herein, an 
"account" refers to a formal business arrangement providing for regular dealings or 
services, such as banking, and involving the establishment and maintenance of an 
account. Written information may be entered into information fields of a check by a 

20 machine, such as a printer. In some embodiments, an "information field" may refer to 
pre-printed information on a document, or a document stock characteristic, such as 
graphic elements or machine-printed text. 

FIG. 3 illustrates an embodiment of a system and method for analyzing and 
25 assessing documents. Document image archive 210 may include an archive of images of 
documents that may include variable written information and/or pre-printed information. 
The document images may be created from valid processed documents that include valid 
written information corresponding to known individuals or writers. The document 
images may also be created fi-om unprocessed and/or forged documents. As used herein, 
30 an "image" is a representation of a graphics image in computer memory. The image may 
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be composed of rows and columns of dots. The value of each dot, e.g., whether it is filled 
in or not, is stored using one or more bits of data. A "bit," short for binary digit, is the 
smallest unit of information on a computer system. 



5 In one embodiment, document image archive 210 may include a payment 

instrument archive that includes images of valid processed payment instruments. In some 
embodiments, the images in the archive may include images of several types of 
documents corresponding to known individuals or entities. For example, the archive may 
include images of checks and images of credit card receipts corresponding to a particular 
10 individual or individuals or entity. The particular individual or individuals or entity may 
be authorized writers. "Authorized writers" generally refer to writers permitted and/or 
with the legal right to make entries on a document, such as one or more account owners 
of a payment instrument account. 

15 For example, a payment instrument archive may be created and stored by 

Check Vision software from Computer Sciences Corporation of El Segundo, Califomia. 
A camera may be used to capture digital images of payment instruments. For example, a 
bank may capture digital images of payment instruments presented for payment. Digital 
images of payment instruments may be archived for analysis. In one embodiment, the 

20 images may be transferred to archive 210 via the Intemet. A database of images of any 
type of document including variable handwritten information, variable machine-printed 
information, and/or pre-printed information may be created and stored on a memory 
medium. 

25 As shown by data flow 218, document images from archive 210 may be provided 

to document analyzer 214. Document analyzer 214 may create a profile that corresponds 
to the writing of one or more individuals or an entity. The profile may also include pre- 
pre-printed information. The profile may be created from digital images of previously 
captured documents in the archive. In one embodiment, document analyzer 214 may be a 
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payment instrument analyzer that creates a payment instrument account profile for an 
account fi-om images of valid processed payment instruments of an accoimt. 

Document analyzer 214 may extract information from one or more images of 
5 documents of writers to create the profile. The profile information may include writing 
characteristics and pattems, data content, semantic pattems, and docmnent layout that 
uniquely characterize the writers and the document. In one embodiment, the document 
profile may include profile information from more than one type of document 
corresponding to known writers. As shown by data flow 220, a profile may be stored in a 

10 profile database 212. In one embodiment, profile database 212 may be a payment 
instrument profile database. In certain embodiments, the database may be stored in 
memory on a computer system. Altematively the database may be stored in memory on 
various types of portable memory media not coupled with a computer system. For 
example, a memory medium may include a computer chip or magnetic strip. The 

15 computer chip or magnetic strip may be coupled with a card (e.g., a credit card, debit 
card, identification card, etc). 

In one embodiment, document 213 may be provided to document analyzer 214, as 
shown by data flow 230. Document 213 may include one or more information fields that 

20 include written and/or pre-printed information. Written information in the one or more 
information fields may be asserted to have been entered by particular writers. Document 
213 may be, for example, an image of a payment instrument that was previously 
presented for payment to a bank. The writers may include one or more account owners. 
Altematively, document 213 may be a bank account application written by an appUcant. 

25 The writers asserted to have entered written information on document 213 may 

correspond to a writing profile that is stored on profile database 212. For example, 
document 213 may be a payment instrument that corresponds to a payment instrument 
accoimt profile. Document analyzer 214 may perform one or more analyses or tests for 
assessing fraud on document 213 using a profile from the profile database 212, as shown 

30 by data flow 222. A document may be fraudulent if it has been altered, written, or created 
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by an individual other than one of the authorized writers for a document, such as one or 
more payment instrument account owners. An individual who fraudulently writes, ahers, 
or creates a document may be referred to as a forger of a document. For example, a 
"forger" of a payment instrument may be an individual who alters or writes a payment 
5 instrument of an account not owned by the forger without the permission of one or more 
of the account owners. In addition, a forger may be an individual who signs a credit card 
slip corresponding to a credit card account not held by the individual. 

The results of the tests or analyses on dociunent 213 may be provided to a fraud 
10 detector 216, as indicated by data flow 224. Fraud detector 216 may assess from the tests 
or analyses whether document 213 is potentially fraudulent. In one embodiment, if 
document 213 is assessed not to be a forgery, the computer system may notify a document 
processing system 217 that the document is valid, as indicated by data flow 226. 
Alternatively, fraud detector 216 may assess that document 213 is potentially fraudulent. 
15 In this case, document 213 may be submitted, as indicated by data flow 228, for further 
review 219. The result of document review 219 may be sent to document processing 217. 
For example, a reviewer may determine that a payment instrument is fraudulent and 
instruct the bank not to make payment. The determination of whether the payment 
instrument is fraudulent may involve further research. For example, a reviewer may 
20 contact the account owner corresponding to the payment instrument. Rejected payment 
instruments may be returned to depositors. 

In an alternative embodiment, document analyzer 214 may extract information 
from document 213 for piuposes other than assessing fraud. For example, text from 
25 document may be recognized and extracted to save labor that would be expended in 
keying in the text. In addition, information may be extracted from document 213 and 
stored in profile database 212. 

Furthermore, the information in the profile may be used for data mining. Data 
30 mining refers to the process of looking for hidden pattems in a group of data. Data 
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mining may be used to find correlations between information fields to predict the content 
of, for example, a payment instrument presented for payment. For instance, data mining 
may be able to predict an entry in one information field of a document based on the entry 
in another. 

5 

In one embodiment, a profile may include specific information relating to one or 
more information fields of a document. The specific information corresponding to each 
information field is generally entered in writing by the writer of the document. Detection 
of handwriting in an information field of a document belonging to someone other than an 
10 authorized writer is evidence of potential fi-aud. For example, handwriting in any 

information field of a payment instrument belonging to someone other than one of the 
account owners is evidence of fi"aud. 



FIG. 4 depicts an illustration of a check that includes handwritten information in 
15 the information fields of the check. Check 264 includes payee field 266, date field 272, 
courtesy amoxmt field 274, legal amount field 268, memo field 270, and signature field 
276. Payee field 266 generally includes the name of an individual or entity. Date field 
272 may include a date after which the check may be paid in terms of a month, day, and 
year. Courtesy amount field 274 may include the amount, for example, in dollars, in 
20 numeric form for which the check is written. Legal amount field 268 may include the 
amount in dollars in alphanumeric form for which the check is written. Memo field 270 
may include any information a writer of the check may desire to enter. A writer may 
enter information in the memo field relating to the purpose of the payment, for example, 
"June Rent." A writer may also enter an accoimt number that corresponds to an account 
25 the writer has with a payee. For example, a writer may enter a writer's account number 
with a utility company or a writer's credit card account number. Signature field 276 
includes the handwritten signature of one of the owners of the account. As used herein, a 
"signature" may be defined as the name of a person written with the person's own hand. 
Label 265 refers to the stock characteristics and/or pre-printed information of the check. 

30 

/ 
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FIG. 5 depicts an illustration of a giro that includes handwritten information in 
information fields of the giro. Giro 278 includes debit account fields 280, amount field 
282, description field 284, credit account field 286, name field 288, city field 290, and 
signature field 292. Giro 278 also includes text 294. Debit account field 280 includes the 
5 number of an account to be debited or charged against to pay the amount for which the 
giro is written. Amount field 282 may include the amount, for example in euros, in 
nimieric form for which the giro is written. Description field 284, like the memo field, 
may include any information a writer of the giro may desire to enter. Credit account field 
286 includes the number of an account to be credited in the amount for which the giro is 
10 written. Name field 288 includes the payee of the giro. City field 290 includes the name 
of the city where the giro credit recipient's bank is located. Signature field 292 includes a 
signature of one of the giro account owners. 

In one embodiment, a writing profile, such as a payment instrument account 
15 profile, may include profiles for one or more of the information fields in a document. 

The profile of the information fields may include writing characteristics and patterns, data 
content, and/or semantic pattems that uniquely characterize the writing entered into 
information fields by particular writers. An information field of a document, such as a 
payment instrument, may include one or more entry types that the writer of the document 
20 may enter in the information field. An entry type refers, for example, to a specific name 
or number that one or more owners of an account enter in a field. For instance, entry 
types of a payee field correspond to payee names to which account owners write checks. 

In an embodiment, a writing profile for a document, such as a payment instrument 
25 account profile, of an entry type of an information field may include one or more 

representations of the entry type. The one or more representations may be referred to as 
writing profile representations. Writing profile representations may include handwriting 
profile representations and machine-printed profile representations. An embodiment of a 
method of generating a payment instrument account profile for an account may include 
30 providing one or more payment instruments written by one or more account owners. At 
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least one of the payment instruments may include at least one inforaiation field. In one 
embodiment, images of writing in at least one information field maybe obtained. The 
payment instrument images maybe obtained fi-om the image archive discussed in FIG. 3. 
The method may further include determining at least one profile representation of firom at 
5 least one of the information fields. 

At least one of the writing profile representations may correspond to at least one 
entry type of at least one of the information fields. At least one variant of the written 
entry type of an information field may be included in the handwriting profile 
10 representations. 

A "variant" refers to a distinct written sample of a type of written information 
such as a character or set of characters. A type of written information may be, for 
example, a letter of an alphabet or a signature. Generally, writing of an writer, such as an 

15 individual, includes writing characteristics and pattems, data content, and/or semantic 
pattems that are unique to the writer. A single sample of an individual's handwriting, for 
example, may not include all the unique properties of the handwriting of an individual. A 
single variant includes at least some of such properties. Variants of a particular type of 
written information, such as a signature, may include a majority of the writing 

20 characteristics and pattems, data content, and/or semantic pattems that are unique to the 
individual. For example, an individual may consistently include a set of strokes in his or 
her signature. However, the individual may not include all such strokes in every 
signature sample. 

25 Furthermore, a method of assessing fi-aud in a document, such as a payment 

instrument, may include performing one or more fi*aud tests. A document may be 
firaudulent if at least some of the writing on the document was not entered by authorized 
writers permitted to make entries on a document, such as one or more account owners of 
a payment instrument accoimt. A docmnent may also be firaudulent if at least some of the 

30 pre-printed information on the document does not approximately match the pre-printed 
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information in a pre-printed information profile. Fraud tests may include analyses of 
writing characteristics and patterns, data content, and/or semantic patterns of entries in 
individual information fields and between information fields of a document such as a 
payment instrument. FIG. 6 depicts a flow chart of a method for assessing fraud in a 
5 document. Assessing firaud in a dociunent may include providing a document to 

computer system, as shown at step 500. Fraud tests may then be performed, as indicated 
at step 502, on one or more of the information fields of the dociunent. At step 504, the 
method may include assessing whether the payment instrument is potentially fraudulent 
based on the results of the one or more fraud tests. 

10 

In an embodiment, a fraud test may include an assessment of whether information 
in an information field of the document approximately matches a writing profile or pre- 
pre-printed information profile (or payment instrument account profile) of the 
information field. Failure of information in an information field to approximately match 

15 a writing profile may be evidence that the information was not made by at least one of the 
authorized writers permitted to make entries on the document (such as an account owner 
of a payment instrument account). Matching information in an information field with a 
profile may refer to comparing the information as a whole to the profile. Matching an 
information may also refer to comparing discrete elements or characteristics of the 

20 information to the profile. Therefore, matching may include a subset of several fraud 
tests. In one embodiment, a fraud test may include analyzing variations among discrete 
elements in information in an information field. Similarly, a fraud test may include 
analyzing a comparison of information in different information fields. Another fraud test 
may include analyzing correlations of information in different information fields. In 

25 addition, a fraud test may include assessing whether information in an information field 
approximately matches a lexicon associated with the information field. 

The result of the fraud tests may provide evidence that a document, such as a 
payment instrument, is potentially fraudulent. The strength of the indication of fraud may 
30 be different for each fraud test. In one embodiment, one or more of the fraud tests may be 
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assigned a fraud weight, such that the fraud weight corresponds to the strength of the 
indication of fraud in the payment instrument. An assessment of whether a document, 
such as a payment instrument, is fraudulent may be based on one or more of the fraud 

tests and the corresponding fraud weights. The assessment may be made in the fraud 
5 detector depicted in FIG. 3 . 

hi an embodiment, when a fraud test indicates fraud, the computer system may 
generate a flag indicating that the document is potentially fraudulent. The flag may 
include a fraud weight that corresponds to a strength of the indication of fraud of the 

10 fraud test. The fraud weight ofa fraud test may depend on a number of factors. For 

example, fraud tests involving features that are consistently present in certain documents, 
such as payment instruments of an account, may receive a greater weight than fraud tests 
involving features present less frequently. For example, variations in writing features that 
are always present may receive a higher weight than variations in features that are 

15 infrequently present. In addition, fraud tests involving fields where fraud is frequently 
perpetrated in payment instruments, such as a courtesy amount field and a legal amount 
field, may receive a higher weight than fraud tests relating to other information fields. 

Table 1 . Summary of Content in a Document Profile and Corresponding Analysis 



Profile Component 


Content 


Analysis Technique 


Information Field 
Content Shape 


Mathematical representations for 
variants of types of written 
information 


Information Field 
Content Shape Analysis 


Information Field 
Content Image 


Images for variants of types of written 
information 


Information Field 
Content Image Analysis 


Digit/Alpha 


Mathematical representations for 
variants of a letter type and a numeral 
type 


Digit/Alpha Analysis 


Symbology 


Mathematical representations for 

variants of a symbol type 

Images for variants of a symbol type 


Symbology Analysis 


Syntax Pattem 


Elements and ordering of elements in 
specific information fields 


Syntax Pattem Analysis 


Lexicon 


List of the names that have previously 


Lexicon Analysis 
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been recognized on documents 
associated with a set of individuals 
and/or accounts 




Document Stock or 

Pre-printed 

Information 


Representation of the nature and 
location of the graphic elements and 
machine-printed text that appear on a 
document associated with a set of 
individuals and/or accounts 
Mathematical representations for 
variants of types of pre-printed 
information, including font type, 
information 

Images for variants of types of 
machine-printed information, 
including font type 


Document Analysis 


Information Field 
Cross Correlation 


Table listing the cross field 
relationships of interest in a document 
associated with a set of individuals 


Cross Field Matching 
Analysis 



Table 1 provides a summary of components of content (i.e., variable writing and 
pre-printed information) in a document profile, such as a payment instrument account 
profile, and analysis techniques according to one embodiment. The analysis techniques 
5 may be appUed to the corresponding profile contents to assess fi-aud in a document such 
as a payment instrument. The information field content shape profile may include 
mathematical representations for variants of types of written information. The 
information field content image profile may include images for the variants of types of 
written information. The writing profiles may include representations of at least one font 

10 style of machine-printed writing. The digit/alpha profile may include both mathematical 
representations and images for the variants of characters such as letters and numerals. 
The symbology profile may include mathematical representations and images of variants 
of symbols that appear in the information field of a document such as a payment 
instrument (e.g., a in the legal amount field). The syntax pattern profile may include a 

15 list with elements and an order of the elements in specific information fields. For 

example, a syntax pattern profile may include the variants of the form of a month, day, 
year, and punctuation and the order of the month, day, and year and punctuation in the 
date field. In addition, the lexicon profile may include a list of names that have 
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previously been recognized for an account in a particular information field, such as payee 
names in a payee field. The document stock or pre-printed profile may include 
representations of pre-printed information such as graphic elements (e.g., bank logos) and 
machine-printed text (e.g., name and address of account owners) that appear on a 
5 document, such as a payment instrument. The document stock profile may also include 
mathematical representations and/or images of machine-printed text. The information 
field cross correlation profile may include a list of cross field relationships that may occur 
with a particular fi-equency in a document associated with particular writers, such as in a 
payment instrument of an accoxmt. For example cross field relationships may include: 
10 account number in a memo field to payee name, payee name to legal and courtesy 
amoimt, and identity of check writer from the signature field to syntax pattems and 
symbology in other fields. 

Handwriting and/or writing may include, but is not limited to a mathematical 
15 representation and/or an image. Handwriting and/or writing may also include, but is not 
Hmited to at least one type of handwritten and/or written information such as a word type 
and/or character type. Handwriting and/or writing may further include, but is not limited 
to a global feature of handwriting and/or writing, a local feature of handwriting and/or 
writing, a syntax pattem, and/or a lexicon name for an information field. A handwriting 
20 and/or writing profile representation may include, but is not limited to at least one of the 
types of handwriting and/or writing profiles described in Table 1 . 



Table 2. Summary of Content Analysis for Payment Instrument. 



Information Field 


Content Analysis 


Pre-printed 
information 


Matching of all preprinted information including machine- 
printed text, logos, line and other graphic elements 
Font Matching 


Courtesy Amount 


Individual character analysis 

Character pattems surrounding the courtesy amount 

Symbology used in writing the cents amount 


Legal Line - Dollar 
Amount 


Global handwriting features 
Word matching 
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Individual character analysis 

Symbology connecting the dollar and cent content 

Character pattems surrounding the legal amount 


Legal Line - Cents 
Amount 


Punctuation 
Individual digits 


Payee 


Global writing features 
Words 

Individual characters 

Lexicon matching to generic lists of payees (e.g., common 

Lexicon matching to suspicious payees (payee names frequently 
involved in transactions with high fraud risk) 
Matching to ASCII Hst of payees common for the account 
Matching to handwriting of payees common for the account 


Signature 


Global writing characteristics 
Word matching 


Memo 


Global writing characteristics 
Individual character analysis 


Date 


Pattems 

Individual character analysis 


Endorsement 


Matching of endorsement to payee 



Table 2 describes a summary of various embodiments of analysis for information 
fields of a payment instrument. The profiles described in Table 1 may be applied to 
assess a payment instrument using the analysis simmiarized in Table 2. 

5 

Profile representations of variable writing and pre-printed information (e.g., 
machine-printed text) may be stored in memory on a computer system in terms of 
mathematical representations. In an embodiment, a writing profile, such as a payment 
instrument account profile, may include one or more mathematical representations of 
10 variable writing and/or pre-printed information. The mathematical representations may 
include one or more variants of an entry type of an information field. As noted above, a 
variant refers to a distinct version of a type of written information. For example, the 
appearance of a handwritten signature of an individual, such as an account owner, may 
tend to vary, even within a short time period. The account owner may have several 
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distinct versions of his or her signature. The one or more mathematical representations in 
a signature profile for an account owner correspond to one or more of the variants of the 
signature, hi other embodiments, a writing profile, such as a payment instrument account 
profile, in memory on a computer system may include one or more images. In a similar 
5 manner, one or more of the images may include images of one or more variants of an 
entry type of an information field. 

In one embodiment, mathematical representations of writing maybe expressed as 
feature vectors. For example, U.S. Patent Nos. 6,157,731 to Hu et al., 6,084,985 to 

10 Dolfing et al., 5,995,953 to Rindtorff et al, 5,828,772 to Kashi et al., and 5,680,470 to 
Moussa et al., which are incorporated by reference as if fiiUy set forth herein, disclose the 
use of feature vectors to represent handwriting. Feature vectors are vectors that may 
include one or more writing features that characterize writing as elements of the vectors. 
For example, features included in a feature vector may represent the strokes that make up 

15 written information. Since writing has a local character and a global character, both local 
features and global features may characterize writing. Global features describe general 
characteristics of writing. Consequently, global features may be discernible in all 
information fields of a document such as a payment instrument. Global features in a 
profile for an information field may be the result of combining global features from 

20 written information in several information fields. Global features may include global 
slant, tangent entropy, global thickness of stroke (pen thickness), and curvature entropy. 

"Entropy" refers to a way of measuring information content. Something that is 
very predictable has an entropy of zero, while something having little or no predictability, 
25 has maximum entropy. The more predictable writing appears, the lower its entropy. 

Handwriting strokes may be parallel or may have several directions. Handwriting that is 
predictable may be composed of strokes in relatively regular pattems. On the other hand, 
a signature with many vertical and horizontal strokes may have higher entropy. 
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Curvature versus tangent entropy refer to a measure of changes in curves and 
stroke directions, respectively. The style of writers may be classified according to their 
entropies. The entropy, H(i), of stroke i may be calculated by H(i) = -pi log(pi), where pi 
is the probability of stroke i. 

5 

Local features refer to characteristics specific to a particular writing sample. 
Local features may include local slant, leading and trailing tail shape and direction, 
topology of a digit, stroke distribution (e.g., proximity of strokes, height, width), local 
thickness of stroke (to evaluate shakiness in handwriting), alpha versus numeric pattems 
10 (e.g., date), punctuation, symbols, '00' shape in amounts, and 'xx/100' structures in the 
legal amount. 

Furthermore, topological vectors may represent the relationship between strokes. 
The topological vectors may include information about the location of strokes with 
15 respect to one another, for example, tangent and direction vectors. 

Li an embodiment, one or more of the variants of types of written information of 
an information field of a writing profile may include a majority of the features that 
characterize the writing of authorized writers, such as one or more account owners. 
20 Stable features refer to features that tend to appear consistently in writing in different 
samples of a writer. Stable features may tend to be more significant in fi-aud assessment 
and recognition than weak features. Weak features refer to features that tend not to appear 
consistently in writing samples. Furthermore, writing in the information fields of a 
document, such as a check or giro, may be converted into a mathematical representation. 

25 

FIGs. 7, 8, 9, and 10 illustrate features that may be included in mathematical 
representations of information or entries in information fields of a payment instrument. 
FIG. 7 represents nine variations of a signature entry in a signature field. Although some 
variation exists in the signature entries, bottom left stroke 364 and top right stroke 366 
30 are consistently present. Such features may be considered to be stable features. FIG. 8 
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represents entries from a payee field. Several characteristics are consistent in each of 
entries 324-330. For example, the characters tend to be upright with block letters. Also, 
the character shapes, such as that of the 'C, are consistent. FIG. 9 represents entries from 
a courtesy amount field. The courtesy amount entries also exhibit several consistent 
5 features. These include the cents '00' and non '00' features, as shown by entries 332 and 
334. Digit shapes are also consistent, for example, the '4's, as shown by entries 333 and 
334 and the '2's, as shown by entries 335 and 336. FIG. 10 represents legal amount 
entries in a legal amount field. The legal amount entries include dollar amoxmt 348, 
symbol ('+') 350, and symbol ('xx/100') 349. The dollar amount and symbols are 
10 consistent among the various entries. 

An embodiment of a method of generating a writing profile for information fields 
of a document is depicted by a flow chart in FIG. 11 . In an embodiment, one or more of 
the documents may be a payment instrument. The information fields may include 

15 handwriting of one or more account owners of a payment instrument account. The 

method may include providing one or more documents to the computer system, as shown 
in step 526 of FIG. 1 1. In one embodiment, at least one of the documents may include at 
least one information field. In other embodiments, at least one of the documents may 
include at least two information fields. One or more of the documents may be provided 

20 by accessing at least one image from a database in memory on a computer system. At 
least one of the documents may be a valid document. In certain embodiments, the 
method may include obtaining images of writing in the information fields. The method 
may further include determining at least one writing profile representation for at least two 
information fields, as shown in step 528. Determining at least one writing profile 

25 representation may use writing from at least one of the information fields of the 
documents. At least one of the writing profile representations may be stored on a 
memory medium on a computer system. 

In an embodiment, at least one writing profile representation may be assessed for 
30 at least two of the information fields using writing from at least one of the information 
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fields. Alternatively, at least two writing profile representations may be assessed. In 
some embodiments, writing may be used fi^om at least two information fields. 

In other embodiments, at least two writing profile representations may be assessed 
5 for at least one of the information fields using writing from at least one of the information 
fields. Altematively, at least two writing profile representations may be assessed for at 
least one of the information fields. In some embodiments, writing may be used fi"om at 
least two information fields. 

10 In some embodiments, at least one writing profile representation may be assessed 

for at least one of the information fields using handwriting fi-om at least two of the 
information fields. Altematively, at least two writing profile representations may be 
assessed. In another embodiment, at least one writing profile representation may be 
assessed for at least two of the information fields. Some embodiments may fiirther 

15 include determining at least two writing profile representations for at least two of the 
information fields using writing fi:om at least two information fields. 

The method may fiirther include determining mathematical representations of 
writing in one or more of the information fields. In some embodiments, determining 
20 mathematical representations of writing may involve converting images of writing in at 
least one of the information fields to mathematical representations. 

At least one of the writing profile representations may include mathematical 
representations of the writing in the information fields. In an embodiment, determining 

25 writing profile representations may involve determining variants of the mathematical 
representations of writing. In an embodiment, at least one writing profile representation 
may include at least one writing variation (variant) of an example of at least one type of 
written information. For example, profile representations, belonging to the Digit/Alpha 
profile described in Table 1, may include variants of the handwritten letter "a." The 

30 writing profile representations may also include variants of one or more entry types in the 
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one or more information fields. In an embodiment, at least one of the writing profile 
representations may be an image. In other embodiments, at least one of the writing 
profile representations may be a mathematical representation. 

5 The conversion of an image to a mathematical representation expressed as feature 

vectors is disclosed in U.S. Patent Nos. 6,157,731 to Hu et al., 6,084,985 to Dolfing et al., 
5,995,953 to Rindtorff et al., 5,828,772 to Kashi et al., and 5,680,470 to Moussa et al, 
which are incorporated by reference as if fiilly set forth herein. For example, one method 
may include first converting an image to a runlen image. A "runlen image" refers to a 

10 raster scan image that is represented by black (where the text is) and white runs (where 
the background is). A runlen image has N lines (in a raster scan) and each line is 
represented by a variable number of runs (white-black-white-black-...). A run may be 
represented by a start point and its length. The runlen image may then be converted to a 
Freeman chain code. A "Freeman chain code" is an image representation that has an 8- 

15 direction code that provides a contour of an object in an image, for example, a letter. The 
Freeman chain code may then be converted to a tangent, which may be converted to 
curvature. The curvature may then be converted to a handwriting or writing stroke. 

In an embodiment, one or more variants among the mathematical representations 
20 may be assessed using a clustering algorithm. Clustering algorithms are methods of 
grouping large sets of data into clusters of smaller sets of similar data. The goal of a 
clustering algorithm is to reduce an amount of data by categorizing or grouping similar 
data items together into a "cluster." A clustering algorithm finds natural groups of 
components (or data) based on some similarity. In particular, a clustering algorithm may 
25 determine groups of mathematical representations based on similarity of writing features. 
The clustering algorithms that assess the variants of a type of written information may use 
curvature and tangent profile matching (entropy measure), dynamic warping/matching, 
and K-nearest neighbor techniques. 
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FIG. 12 is an illustration of determining variants for a handwriting profile fi*om 
handwriting samples. Handwriting samples 312 depict six different samples of a 
signature fi-om the signature field of a check. The signature samples are very consistent 
with little variation from one sample to another. Therefore, only one variant may be 
5 assessed to characterize the signature of the account holder (e.g., variant 3 1 8). 

Handwriting samples 314 depict seven samples fi-om the city field of a giro. There is also 
little variation in the samples of the city name. In this case, variant 320 may be assessed 
to characterize the city name. In addition, handwriting samples 315 depict samples of the 
payee line of a check. A small amount of variation is exhibited between the samples. 
10 Variant 317 may be assessed to characterize the samples. Handwriting samples 316 
depict seven samples of a legal amount fi-om a check. In this case, there are three entry 
types with respect to the dollar amount: four samples with "one hundred dollars" and one 
sample each for "two," "three," and "four hundred dollars." Only one variant of the "one 
hundred dollars" samples may be necessary. 

15 

As described herein, a writing profile may be assessed firom more than one sample 
of writing, as shown in FIG. 12. It is advantageous to base a writing profile on more than 
one sample due to variations in writing of the authorized writers. A greater variation in 
writing of the authorized writers may require a greater number of samples to characterize 
20 the writing with a writing profile. 

Variation in handwriting may be both inherent and dynamic. Dynamic variation 
refers to the change in an individual's handwriting over time, for example, over months 
and years. For example, the variation of a date from a date field and a signature from 

25 checks written over a period of months is illustrated in FIG. 13. Samples 298, 300, 302, 
304, and 306 represent a variation from May to September of the year 2001. In this case, 
the dynamic variation for both the date and the signature is apparent. The appearance of 
the signature varies between May and September. In addition, there is a change in the 
syntax of the date field from an alphabetic month to a numeric month between May and 

30 September. 
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Inherent variations refer to variations in handwriting that are independent of time 
or that may occur over a short period of time, for example, over days. The source of 
inherent variations may be inconsistent handwriting of an individual. For instance, an 
5 individual may consistently write a signature two or more ways. 

In one embodiment, a method of generating a handwriting profile, such as a 
payment instrument account profile that takes into account inherent variations is depicted 
in FIG. 14. In step 506, one or more documents may be provided to a computer system. 

10 The documents may be provided from a database corresponding to a document archive 
shown in FIG. 3. The documents may be vahd documents submitted by authorized 
writers. The documents may take into accoimt variations in the handwriting of the 
authorized writers only up to the date of the latest submitted document in the database. 
At least one handwriting profile representation may be assessed from one or more of the 

15 documents, as shown in step 508. The handwriting profile representations may be, for 
example, images and/or mathematical representations. One or more of the handwriting 
profile representations may then be stored in memory on a computer system, as shown in 
step 510. Altematively, at least one of the handwriting profile representations may be 
stored on various types of portable memory media not coupled with a computer system. 

20 For example, the handwriting profile representations may be stored in a handwriting 
profile database depicted in FIG. 3. 

In an embodiment, a method of generating a handwriting profile on a computer 
system that accounts for dynamic variations in handwriting is depicted in FIG. 15. As 

25 shown in step 517, the method may include providing one or more documents to the 
computer system. At least one of the documents may include at least one information 
field. In some embodiments, the documents may include at least two information fields. 
At least one of the documents may be a vaUd document, such as a vaUdated payment 
instrument. In an embodiment, the one or more documents may be provided to the 

30 computer system by providing images of the document to the computer system. At least 
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one handwriting profile representation may be assessed for at least two of the information 
fields using the handwriting fi^om at least one of the information fields, as indicated at 
step 518. Altematively, at least one handwriting profile representation may be assessed 
for at least one of the information fields using the handwriting fi-om at least two of the 
5 information fields. In other embodiments, at least two handwriting profile 

representations may be assessed for at least one of the information fields using the 
handwriting fi*om at least one of the information fields. In an embodiment, the 
handwriting profile representations may be stored on a memory medium on a computer 
system. 

10 

In some embodiments, the method may fiirther include providing one or more 
additional documents to the computer system, a shown by step 519. At least one of the 
additional documents may include at least one information field. At step 521, the method 
may include updating at least one of the handwriting profile representations using at least 

15 one of the information fields of at least one of the additional documents. Altematively, 
updating may also use at least one of the information fields of at least one of the 
documents. In certain embodiments, updating at least one of the handwriting profile 
representations may include modifying at least one handwriting profile representation, 
deleting at least one handwriting profile representation, and/or determining at least one 

20 handwriting profile representation. 

The additional documents may be payment instruments presented for payment to a 
bank that have been validated. The computer system memory may then be updated with 
the handwriting profile representations obtained from the additional documents. 
25 Consequently, the profile may be periodically updated to take into account the dynamic 
variation of the handwriting of the one or more account owners. 

A writing profile, such as a payment instrument account profile, as described 
herein, may be applied to assess fi-aud in documents, such as payment instruments 
30 presented for payment to a bank. Methods for assessing fraud in a document, such as a 
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payment instrument, may require methods for recognizing characters or text on a 
document. "Recognizing," as used herein, refers to the process of identifying elements of 
written information, such as numerals, letters, and symbols, from a representation of the 
characters. The representation may be an image or mathematical representation, for 
5 example. Elements of written information may be identified fi-om mathematical 
representations fi-om feature vectors. 

Written information in an image representation may be identified by converting 
the image into a computer processable format, such as ASCII. "ASCII" is an acronym for 
10 the "American Standard Code for Information Interchange." ASCII is a code for 

representing Enghsh characters as numbers, with each letter assigned a number firom 0 to 
127. Most computer systems use ASCII codes to represent text to enable transfer of data 
from one computer to another. 

15 Several products are commercially available for recognition of written 

information in images. For example, Checkscript and Quickstrokes are character 
recognition software products from Mitek Systems of San Diego, California. In addition, 
Checkplus 2.0 is character recognition software provided by Parascript of Niwot, 
Colorado. A2iAofNew York, NY provides CheckReader™. The Corroborative Image 

20 Character Recognition (CICR) System may be obtained from Computer Sciences 
Corporation of El Segundo California. Gaussian Probabilistic Distribution 
(GPD) software may be obtained from Malayappan Shridhar of the School of Engineering 
at the University of Michigan at Dearborn, Dearbom, Michigan. 

25 hi one embodiment, the information field content shape profile, referred to in 

Table 1, may include at least one mathematical representation of writing on a computer 
system. At least one of the mathematical representations may represent writing of 
authorized writers, such as one or more account owners. In an embodiment, 
mathematical representations may include one or more entry types of an information field 

30 of a document. The mathematical representations characterize writing of the authorized 
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writers. The mathematical representations may be represented in terms of feature vectors, 
as described herein. In an embodiment, at least one of the mathematical representations 
may include at least one variant of an entry type of an information field. 

5 According to one embodiment, an information field content shape profile may be 

generated for any information field of a document that includes writing. For example, 
information field content shape profiles of a checking account (see FIG. 4) may be 
generated for payee names of a payee field, dates in a date field, amoimts in a courtesy 
amount field, amounts in a legal amount field, descriptions in a memo field, and a 
10 signature in a signature field. Additionally, information field content shape profiles of a 
giro accoimt (See FIG. 5) may be generated for account numbers in a debit account field, 
amounts in an amount field, descriptions in a description field, account numbers in a 
credit accoimt field, names in a name field, city names in a city field, and signatures in a 
signature field. 

15 

Fraud may be assessed in a document by comparing written information in an 
information field of the document to a writing profile, such as a payment instrument 
account profile. According to one embodiment, a method of comparing written 
information to a writing profile using a computer system may include providing the 

20 written information from a docvmient to the computer system. The written information 
may be in the form of a mathematical representation that includes one or more sample 
features. Furthermore, at least one writing profile representation may be stored in 
memory on a memory medium. At least one writing profile representation may include at 
least one mathematical representation. At least one mathematical representation may 

25 include one or more profile features. In an embodiment, the sample features and the 
profile features may include both global features and local features. 

The method may fiirther include assessing non-matching features from a 
comparison of the sample features and profile features. In some embodiments, the non- 
30 matching features may be associated with fraud weights. 
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"Match" refers to a degree of similarity between samples of written information. 
For example, U.S. Patent Nos. 5,995,953 to RindtorfiFet al, 5,828,772 to Kashi et al., and 
5,710,916 to Barbara et al., which are incorporated by reference as if fully set forth 
5 herein, disclose methods that include assessing a degree of similarity between samples of 
handwritten information based on a comparison of the feature vectors of the samples of 
handwritten information. 

According to one embodiment, determining whether a sample of written 
10 information matches a profile includes both "global matching" and "local matching" of 
features. Generally, global matching refers to assessing whether written information may 
belong to a set of individuals, such as one or more accoimt owners, based on global 
characteristics. In global matching, global features, such as slant, tangent, and curvature 
entropy, in feature vectors of samples may be compared to assess whether features match. 
15 Global matching may be applied, for example, in assessing whether a payee name entry in 
a payee field and a legal amount entry in a legal amount field were written by the same 
person. 

Furthermore, local matching refers to assessing whether two samples of written 
20 information correspond to the same character, word, or set of words and characters. 

Local matching may be applied, for example, in assessing whether a signature was written 
by an account owner. The signature may be compared to writing profile representations 
of signatures of one or more account owners. Local matching may employ mathematical 
techniques such as K-nearest neighbor and neural networks. For instance, in the case of a 
25 neural network appHed to the legal amount field, a sample of written information and all 
of the variants of "One Hundred" may be converted into feature vectors. A neural net 
may then be trained using a standard back propagation training algorithm to assess 
whether the sample of written information matches at least one of the variants. 
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In certain embodiments, the writing profile may be used to assess a document, 
such as a payment instrument that is presented to a bank for payment. A method depicted 
in FIG. 16 of assessing a document may include providing a document to the computer 
system, as shown at step 418. The document may include at least one information field. 
5 In another embodiment, the document may include at least two information fields. The 
method may fiirther include comparing writing in at least two of the information fields of 
the document to at least one writing profile representation, as shown at step 420. At least 
one writing profile representation may be fi'om at least one information field of at least 
one other document. In an embodiment, at least the one other document may be a valid 

10 document. In another embodiment, the method may include comparing writing in at least 
one of the information fields of the document to at least one writing profile representation 
from at least two information fields of at least one other document. Alternatively, writing 
in at least one of the information fields of the docimient may be compared to at least two 
writing profile representations from at least one information field of at least one other 

15 document. 

As depicted in step 422, fraud in the document may be assessed using at least one 
of the comparisons. In some embodiments, evidence of fraud may include a failure of at 
least a portion of the writing in at least one of the information fields of the document to 
20 approximately match at least one writing profile representation. Altematively, evidence 
of fraud may be a failure of at least a portion of the writing in at least two of the 
information fields of the document to approximately match at least one writing profile 
representation. 

25 In certain embodiments, the information field content shape profile may be used 

to assess fraud in a document, such as a payment instrument that is presented to a bank 
for payment. A method depicted in FIG. 17 may include obtaining at least one 
mathematical representation of the writing from information fields of the document, as 
shown at step 540. In an embodiment, mathematical representations may be obtained by 

30 converting images of the written information. At least one of the mathematical 
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representations may correspond to an example of a type of written information and/or an 
entry type of an information field. The method may further include providing access to a 
computer system that includes a writing profile, as shown by step 542. In an 
embodiment, the writing profile may include at least one writing profile representation of 
5 writing from one or more valid documents. At least one writing profile representation 
may correspond to at least one variant of a type of written information and/or an entry 
type of an information field. At step 544, at least one of the mathematical representations 
of the handwriting may be compared to one or more of the handwriting profile 
representations to assess whether the written information approximately matches the 
10 profile. If the written information does not approximately match the information field 
content shape profile, the computer may generate a flag indicating that the docimient is 
potentially fraudulent. 

In one embodiment, the information field content image profile, referred to in 
15 Table 1, may include at least one image of writing on a computer system. At least one of 
the images may correspond to writing of authorized writers, such as one or more account 
owners. In an embodiment, at least one of the images may correspond to one or more 
entry types of an information field of a document. At least one of the images may 
characterize the writing of authorized writers, such as the one or more accoimt owners. In 
20 an embodiment, at least one of the images may correspond to at least one variant of an 
example of a type of written information or an entry type of an information field. 
According to one embodiment, as described in reference to the information field content 
shape profile, an information field content image profile may be generated for at least one 
information field of a document. 

25 

In some embodiments, the information field content image profile may be used to 
assess fraud in a document, such as a payment instrument that is presented to a bank for 
payment. A method depicted in FIG. 18 may include obtaining at least one image of 
, writing from information fields of the docimient, as shown at step 546. At least one 
30 image may correspond to examples of types of written information and/or entry types of 
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one or more of the information fields. The method may further include providing access 
to a computer system that includes a writing profile, as shown by step 548. In an 
embodiment, the writing profile may include at least one handwriting profile 
representation firom one or more documents. At least one writing profile representation 
5 may correspond to at least one variant of a type of written information and/or an entry 
type of an information field. At step 550, at least one of the images of the writing may be 
compared to at least one writing profile representation to assess whether the writing 
approximately matches the writing profile. If the writing does not approximately match 
the information field content image profile, the computer may generate a flag indicating 
1 0 that the document is potentially fi-audulent. 

Assessing a degree of similarity or matching of images of written information may 
be performed by several methods. These methods determine the degree of similarity 
between two images of a type of written information such as a type of character or set of 
15 characters, for example, a signature. U.S. Patent No. 6,249,604 to Huttenlocher et al., 
which is incorporated by reference as if fully set forth herein, discusses one such method 
based on the technique of dynamic warping. U.S. Patent No. 6,157,73 1 to Hu et al., 
which is incorporated by reference as if fully set forth herein, describes another such 
method that uses hidden Markov models. 

20 

Computer software that determines a degree of similarity of images of a type of 
written information may be obtained commercially. Glory Signature Verification 
Software (GSVS) fi"om Glory Ltd. HIMEJI, HYOGO, Japan deteraiines, with a degree of 
certainty, whether the same individual wrote two images of a type of handwritten 
25 information. 

FIGs. 18 and 19 illustrate assessment of fraud in the signature field of a giro. In 
FIG. 19, illustrations 438 include a set of samples of handwritten information that 
correspond to handwriting profile representations for the signature field of a giro account. 
30 The samples of handwritten information may be stored as mathematical representations in 
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an information field content shape profile. Alternatively, the samples of handwritten 
information may be stored as images in an information field content image profile. 
Samples 440, 442, 444, and 446 are signatures fi-om different giros. A comparison of 
samples 440-446 to the profile representations 438 may indicate that it is likely that each 
5 of the samples is fi-audulent. 

Furthermore, in FIG. 20 illustrations 448 include a set of samples of handwritten 
information that correspond to profile representations for the signature field of another 
giro account. Sample 450 is a signature fi-om a giro. Sample 450 may likely be assessed 
10 to be a fraudulent signature based on a comparison with illustrations 448. 

FIG. 21 is an illustration of fraud assessment in the courtesy amount field of a 
check of a checking account. Samples 452 represent entries in a courtesy amount field 
fi-om valid checks of the checking account. Sample 454 represents a courtesy amoxmt 
15 field of a check to be validated. Information field content shape analysis may likely flag 
sample 454 as potentially fraudulent due to the raised '00' and the style of the '5.' 

FIG. 22 is an illustration of fraud assessment in a giro. In FIG. 22, the pen 
thickness of the signature in signature field 436 is significantly thinner than the entries in 
20 the other fields, for example, amoxmt field 434. Therefore, the giro may likely be flagged 
as potentially fraudulent. 

FIG. 23 is an illustration of fraud assessment in the city field of a giro. Samples 
456 are entries for a city field from valid giros of an account. Information field content 
25 shape analysis may recognize sample 458 as the same city as the entries in samples 456. 
However, information field content shape analysis may likely demonstrate that the 
handwriting is different than the valid entries. As a result, the giro may likely be flagged 
as potentially fraudulent. 
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In one embodiment, the digit/alpha profile, referred to in Table 1, may include one 
or more sets of written characters on a computer system. The one or more sets may 
correspond to one or more character types. In addition, the one or more character types 
5 may correspond to one or more types of numerals. The one or more character types may 
also correspond to one or more types of letters of an alphabet. A set of written characters 
may include at least one variant of a written character type. The variants of a character 
type may characterize the writing features of the character type of authorized writers, such 
as one or more account owners. For example, a set of handwritten '3's may represent 
10 variations in the way an account owner writes a '3.' In an embodiment, the written 
characters in the one or more sets may be stored as mathematical representations, as 
described herein, on a memory medium. Alternatively, the written characters in the one 
or more sets may be stored as images. 

15 In certain embodiments, the digit/alpha profile may be used to assess fraud in a 

document, such as a payment instrument that is presented to a bank for payment. A 
method depicted in FIG. 24 may include obtaining one or more samples of the writing, as 
shown at step 552. One or more of the samples may include one or more images. In an 
embodiment, the handwriting in the information fields may include one or more written 

20 characters. The method may further include, as shown by step 554, recognizing one or 
more written characters in one or more images of the writing in the information fields. 
The written characters may correspond to at least one character type. The method may 
further include providing access to a computer system that includes a writing profile, as 
shown by step 556. In an embodiment, the writing profile may include one or more 

25 writing profile representations from at least one other documents. The at least one other 
document may be a valid document. In an embodiment, the one or more writing profile 
representations may include at least one variant of a type of written character. At step 
558, one or more of the written characters may be compared to at least one profile 
representation of written characters to assess whether the written characters 

30 approximately match the writing profile. If one or more of the written characters do not 
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approximately match the digit/alpha profile, the computer may generate a flag indicating 
that the document is potentially fraudulent. 

In an alternative embodiment, the method may include converting one or more of 
5 the images of the written information to one or more mathematical representations, as 
described herein. The one or more mathematical representations may include one or 
more mathematical representations of written characters. One or more written characters 
may be recognized from the one or more mathematical representations. The method may 
further include comparing the mathematical representations of written characters to at 
10 least one writing profile representation of written characters to assess whether the written 
characters approximately match the writing profile. 

FIG. 25 is an illustration of converting a character in a handwriting image to a 
mathematical representation. Set of images 352 represent several variations of a 
15 handwritten '3.' At step 354, the character may be recognized using character recognition 
software. The strokes of the character may then be analyzed at step 356. Li this case, the 
nimieral '3' includes two strokes: upper cusp 360 and lower cusp 362. The shape of the 
character may be classified at step 358 using neural net or k-nearest neighbor techniques. 

20 FIG. 26 illustrates assessment of fraud in a numeric information field of a 

payment instrument using the methods described herein. List 400 includes a digit/alpha 
profile for a payment instrument account for numeric characters from zero to nine. The 
profile includes variants of the account for each numeric type. Sample 402 corresponds 
to a numeric entry from a field of a payment instrument. Nimierals 404 were recognized 

25 as the numeral '8' in the sample. The '8's do not appear to match the variants 406 of the 
numeral '8' in the profile. 

In one embodiment, the symbology profile, referred to in Table 1, may include 
one or more written symbols on a computer system. The one or more written symbols 
30 may correspond to one or more symbol types. For example, the one or more symbol 
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types may include, for example, one or more types of punctuation marks and/or a At 
least one of the written symbols may include at least one variant of a type of written 
symbol. The variants of a symbol type may characterize the writing features of the 
symbol type of designated individuals, such as one or more account owners of a payment 
5 instrument account. For example, one or more handwritten '+'s represent variations in 
the way an account owner may write a in the legal amount field. In an embodiment, 
the written symbols in the symbology profile may be stored as mathematical 
representations, as described herein, on a memory medium. Alternatively, the written 
symbols in the symbology profile may be stored as images. FIG. 10 illustrates symbology 
10 in handwriting samples in the legal amount field. The symbol types in the legal amount 
field include symbol 350, a and symbol 349, '00/100'. 

In certain embodiments, the symbology profile may be used to assess fi'aud in a 
document, such as a payment instrument that is presented to a bank for payment. A 

15 method depicted in FIG. 27 may include obtaining one or more samples of the writing, as 
shown at step 560. One or more of the samples may include one or more images. In an 
embodiment, the writing in the information fields may include one or more written 
symbols. The method may further include, as shown by step 562, recognizing one or 
more written symbols in one or more images of the writing in the information fields. The 

20 written characters may correspond to one or more symbol types. The method may further 
include providing access to a computer system that includes a writing profile, as shown 
by step 564. In an embodiment, the writing profile may include one or more writing 
profile representations fi'om one or more documents. In an embodiment, one or more of 
the handwriting profile representations may include at least one variant of types of written 

25 symbols. At step 566, one or more of the written symbols may be compared to one or 
more of the writing profile representations of written characters to assess whether the 
written symbols approximately match the profile. If one or more of the written symbols 
do not approximately match the symbology profile, the computer may generate a flag 
indicating that the document is potentially fi-audulent. 
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In an alternative embodiment, the method may include converting one or more of 
the images of the writing to one or more mathematical representations, as described 
herein. One or more of the mathematical representations may include one or more 
5 mathematical representations of written symbols. One or more written symbols may be 
recognized from one or more of the mathematical representations. The method may 
further include comparing the mathematical representations of written symbols to one or 
more of the writing profile representations of written symbols to assess whether the 
written symbols approximately match the profile. 

10 

FIG. 28 is an illustration of assessment of fraud for a check of a checking account. 
Sample 392 represents an entry in a legal amount field of a check to be vaUdated. 
Samples 394 represent entries in the legal amount field from valid checks from the 
checking account. Sample 392 may be determined to be potentially fraudulent by the 
15 methods described herein. First, information field content shape analysis may 

demonstrate that the writing of the dollar amount of sample 392 does not match samples 
394. In addition, there are several differences in symbology. For example, line 391 is 
different than line 393. The 'xx/100' symbol differs as shown by comparing symbol 397 
and symbol 399. Samples 394 include line 395, which is absent from sample 392. 

20 

In one embodiment, the syntax pattem profile, referred to in Table 1, may include 
at least one syntax pattem. A syntax pattem may include one or more elements. The one 
or more elements in a syntax pattem may be in a specific order. For example, the entries 
in the date field of a check may include a month of the year, a date of the month, a year, 
25 and punctuation marks. At least one syntax pattem in the writing profile may include at 
least one variant of a syntax pattem for an information field. For instance, at least one 
variant may be the manner that one or more account owners enter a date in the date field. 
For example, a date may be written several ways: 2/14/01, 2-14-01, Febmary 14, 2001, 
and 14 February 01. According to one embodiment, elements of the date field may 
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include: a numeric month, an alphabetic month, a numeric date of the month, a two-digit 
year, a four-digit year, a comma, a forward slash, and a dash. 

In one embodiment, the syntax pattern profile may be used to assess fraud in a 
5 document, such as a payment instrument that is presented to a bank for payment. A 
method depicted in FIG. 29 may include obtaining written information in an information 
field, as shown at step 568. The written information may include one or more elements. 
In some embodiments, the written information may be an image and the method may 
include recognizing one or more of the elements in the image. One or more of the 

10 elements may include, for example, written characters and symbols that may appear in the 
date field of a payment instrument. Altematively, the written information may be a 
mathematical representation, as described herein, and the method may include 
recognizing one or more of the elements from the mathematical representation. An order 
of one or more of the elements in the information field of the document may then be 

15 assessed, as shown by step 570. The method may fiirther include, as shown by step 572, 
providing access to a computer system that includes a writing profile. The handwriting 
profile may include one or more writing profile representations of writing from one or 
more vahd documents. In an embodiment, the information field may be a date field and 
the writing profile representations may include variants of a syntax pattem, such as a 

20 written date. The method may fiirther include comparing the elements and the order of 
the one or more elements to one or more writing profile representations to assess whether 
the elements and the order of the one or more elements approximately match the writing 
profile, as shown by step 574. If the elements and the order of the elements do not 
approximately match the syntax pattem profile, the computer may generate a flag 

25 indicating that the payment instrument is potentially fraudulent. 

FIG. 30 illustrates fraud assessment in a date field of a payment instrument. 
Samples 388 include several examples of a date field from payment instruments for an 
account. The date field for the account almost consistently appears as an alphabetic 
30 month, followed by a numeric day, and a two-digit year. Date 389 is the only sample that 
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is inconsistent. Sample 390 is a date field fi"om a payment instrument to be tested for 
fi-aud. The syntax pattem of sample 390 approximately matches only date 389 of samples 
388. In addition, the slant of sample 390 does not match date 389. Therefore, sample 
390 may be potentially firaudulent. 

5 

In one embodiment, the lexicon profile, referred to in Table I, may include one or 
more lexicon names for an information field of a document, such as a payment 
instrument. A lexicon name refers to a specific word or set of characters or symbols that 
has been previously recognized in documents associated with authorized writers, such as 

10 one or more account owners of a payment instrument account. In one embodiment, the 
list of lexicon names may be stored in memory in a computer processable format such as 
a ASCn format. For example, lexicon names for a payee field may include payee names 
that have previously appeared on checks of a checking account. Another example may 
include lexicon names for the city field of a giro account. In one embodiment, the 

1 5 lexicon profile may include a fi:'equency associated with a lexicon name. The fi^equency 
may be a measure of the how often a lexicon name appears on a payment instrument of 
the accoimt. The fi-equency may be expressed as a percentage of payment instruments 
associated with one or more of the lexicon names over a particular time period. For 
example, a lexicon name for a payee field may have appeared on 21% of checks written 

20 over a six month period. In some embodiments, a frequency may be associated with a 
subset of the lexicon names for an information field. For instance, a frequency may be 
associated with the top ten payee names that appear on payment instruments of the 
account. 



25 In certain embodiments, the lexicon profile may be used to assess fraud in a 

document, such as a payment instrument that is presented to a bank for payment. A 
method for assessing fraud is depicted in FIG. 31. In an embodiment, a method of 
assessing fraud in a document using a computer system may include obtaining writing in 
an information field of the document, as shown in step 576. The method may further 

30 include recognizing an entry from the written information in the information field of the 
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document, as indicated in step 578. The entry may include one or more characters or 
symbols. The information field may be, for example, a payee field of a check and the 
entry may be a payee name. Alternatively, the field may be the city field of a giro and the 
entry may be a city name. The method may further include, as shown by step 580, 
5 providing access to a computer system that includes a writing profile. The writing profile 
may include one or more lexicon names for one or more information fields firom one or 
more valid documents. In an embodiment, the method may also include comparing the 
entry to one or more of the lexicon names for the information field in the writing profile 
to assess whether the entry matches the writing profile, as shown in step 582. If the entry 
10 does not approximately match the writing profile, the computer may generate a flag 
indicating that the document is potentially fi'audulent. 

In some embodiments, the method may include determining a firequency 
associated with the entry if the entry approximately matches at least one of at least one of 

15 the lexicon names. If the fi-equency is below a certain level, the computer may generate a 
flag indicating that the payment instrument is potentially fi-audulent. In another 
embodiment, the method may include assessing whether the entry is a member of a subset 
of lexicon names that are associated with a particular fi-equency. If the entry is not a 
member of the subset, the computer may generate a flag indicating that the document is 

20 potentially fi-audulent. 

FIG. 32 illustrates fraud assessment in a city field of a giro. Samples 428 
represent variants of the city name "Bunschoten" for a city field of a giro account. An 
information field content shape or information field content image profile may include 
25 samples 428. Sample 430 is an entry in a city field of a giro. Sample 430 does not appear 
to approximately match samples 428 and was recognized as "Bilthoven." Lexicon 432 
represents a lexicon profile that includes a list of city names that have previously 
appeared on giros of the account. Each city name includes a number in parenthesis 
indicating the number of giros on which the city name has appeared. Sample 430 also 
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appears to fail to approximately match the lexicon profile. Therefore, the giro may be 
potentially fraudulent. 

FIG. 33 is an illustration of fraud assessment in the memo field of a giro of a giro 
5 account. List 464 represents a hst of entries that appeared in the memo field of valid 
giros of a giro account. List 464 may correspond to a lexicon profile. Sample 466 is an 
entry in the memo field of a giro to be validated. A comparison of sample 466 with the 
hst indicates the sample is not consistent with the account. Therefore, the giro is 
potentially fraudulent. 

10 

In some embodiments, the information field cross correlation profile, referred to 
in Table 1, may include cross-field relationships for a docxmient, such as a payment 
instrument account on a computer system. In particular, a writing profile may include 
one or more first lexicon names associated with a first information field of a payment 

15 instrument of the account on a computer system. At least one of one or more of the first 
lexicon names may be associated with one or more second lexicon names associated with 
a second information field. The first lexicon name may include an entry type of the 
information field and the second lexicon name may include an entry type of the second 
information field. The cross-field relationships in the writing profile may include 

20 relationships between information fields that occur with a particular frequency in a 

document, for example, in payment instruments of an account. In this manner, an entry in 
one information field may be used to predict a likely entry type in another information 
field. In one embodiment, a frequency of a particular cross-field correlation in an account 
may be included in the profile. 

25 

Several types of relationships between information fields may occur frequently in 
payment instrument accounts. For example, a particular account number entered in a 
memo field may be correlated with a payee name in the payee field of a check. Also, a 
payee name may be correlated with a particular courtesy amount. In addition, the identity 
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of one account owner of a joint account, obtained from the signature field, may be 
correlated with a syntax pattem in the date field. 

In certain embodiments, the information field cross correlation profile may be 
5 used to assess fraud in a document, such as a payment instrument that is submitted to a 
bank for payment. A method for vahdating a payment instrument is depicted in FIG. 34. 
The method may include assessing whether a first entry in a first information field 
approximately matches one or more first lexicon names in a handwriting profile for the 
first information field, as shown by step 584. The method may fiirther include, as shown 

10 by step 586, obtaining writing in a second information field of the document. Access 

may then be provided to the computer system that includes a writing profile, as shown by 
step 588. The handwriting profile may include cross-field correlations from one or more 
vaUd documents. The method may fiirther include comparing the second entry to a 
second lexicon name of one or more second lexicon names associated with the 

15 approximately matching first lexicon name in the first information field. The comparison 
may be used to assess whether the second entry approximately matches a second lexicon 
name, as shown by step 590. If the second entry does not match a second lexicon, the 
computer may generate a flag indicating that the document may be potentially fraudulent. 
In another embodiment, the frequency that the first lexicon name approximately matches 

20 the second lexicon name may be considered in fraud assessment of the docvunent. 

An embodiment of a method of assessing information in at least one information 
field in a document is depicted by a flow chart in FIG. 35. In an embodiment, one or 
more of the documents may be a payment instrument. The information fields may 

25 include writing of one or more account owners of a payment instrument account. The 
method may include providing a document to the computer system, as shown in step 598 
of FIG. 35. In one embodiment, the documents may include at least one information 
field. In other embodiments, the document may include at least two information fields. 
The document may be provided by accessing at least one image from a database in 

30 memory on a computer system. In certain embodiments, the method may include 
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obtaining information on writing in an information field of the document, as shown in 
step 600. In some embodiments, obtaining information on writing may include 
recognizing written information in an information field. For example, a payee name in a 
payee field of a payment instrument may be recognized. 

5 

As shown in step 602, the method may further include comparing the obtained 
written information in the information field and written information in at least one other 
information field to at least one writing profile representation fi"om at least one other 
document. Altematively, the obtained written information in the information field and 
10 written information in at least two other information fields may be compared to at least 
one handwriting profile representation firom at least one other document. In other 
embodiments, the method may include comparing the obtained written information in the 
information field and written information in at least one other information field to at least 
two writing profile representations fi-om at least one other document. 

15 

In an embodiment, at least one of the other documents may be valid. In certain 
embodiments, at least one of the writing profile representations may include written 
information fi-om the information field and written information from at least one of the 
other information fields. The written information fi:om the information fields may be 
20 fi-om at least one of the other documents. For example, at least one writing profile 

representation may include a cross-field correlation between the information field and at 
least the one other information field. In one embodiment, the cross-field correlation may 
include a lexicon name for the information field and at least one lexicon name for at least 
the one other information field. 

25 

In some embodiments, at least one of the comparisons of written information may 
be used to verify the obtained written information, as shown in step 604. In one 
embodiment, written information may be verified by assessing whether the obtained 
written information in the information field and the written information in. at least one 
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other information field approximately matches at least one writing profile representation 
firom at least one other document. 

In some embodiments, the document stock or pre-printed profile, referred to in 
5 Table 1, may include one or more stock characteristics or pre-printed information of a 
document associated with authorized writers, such as one or more account owners. For 
example, the stock characteristics may characterize a layout of the payment instrument 
corresponding to the account. Stock characteristics may include one or more graphics 
elements, as well as their location, on the payment instrument of the account. Graphics 

10 elements may include, for example, bank logos. The size of the payment instrument of 
the accoxmt may also be a stock characteristic, hi addition, stock characteristics may 
include one or more machine-printed text blocks, along with their locations. Text blocks 
may include, for example, an address of one or more accoimt owners and account 
numbers. The document stock profile may include a machine-printed profile analogous 

15 to a variable writing profile for information fields. The machine-printed profile may 
include mathematical representations and/or images of machine-printed text in the text 
blocks of documents. 

FIG. 36 depicts stock characteristics of a check 340. Label 338 indicates graphics 
20 elements that consist of logos. Label 342 indicates an account/routing order number. In 
addition, label 344 is the name and address of the account owner. 

In certain embodiments, a pre-printed profile may be used to assess a docimient, 
such as a payment instrument that is presented to a bank for payment. A method depicted 

25 in FIG. 37 of assessing a document may include providing a document to the computer 
system, as shown at step 530. The document may include at least one information field. 
In another embodiment, the document may include at least two information fields. The 
method may fiirther include comparing pre-printed information in at least two of the 
information fields of the document to at least one pre-printed profile representation fi-om 

30 at least information field of at least one other document, as shown in at step 532. In an 
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embodiment, at least the one other document may be a valid document. In another 
embodiment, the method may include comparing pre-printed information in at least one 
of the information fields of the document to at least one pre-printed profile representation 
from at least two information fields of at least one other document. Alternatively, pre- 
5 printed information in at least one of the text blocks of the document may be compared to 
at least two pre-printed profile representations from at least one text block of at least one 
other document. 

Pre-printed information may include, but is not limited to a mathematical 
10 representation and/or an image. Pre-printed information may also include, but is not 

limited to at least one type of pre-printed information such as a word type, character type 
and/or graphic element. Pre-printed information may fiirther include, but is not limited to 
a global feature of pre-printed information and a local feature of pre-printed information. 

15 As depicted in step 534, fraud in the document may be assessed using at least one 

of the comparisons. In some embodiments, potential fraud may be indicated by a failure 
of at least a portion of the pre-printed information in at least one of the information fields 
of the docimient to approximately match at least one pre-printed profile representation. 
Altematively, potential fraud may be indicated by a failure of at least a portion of the pre- 

20 printed information in at least two of the information fields of the document to 
approximately match at least one pre-printed profile representation. 

FIG. 38 illustrates fraud assessment in a giro. Text 424 represents machine- 
printed text that is a stock characteristic from a giro of an account. Text 425 and 426 
25 represent the corresponding machine-printed text from giros presented for payment. The 
size of text 424 is 260 pixels x 80 lines. Text 425 and 426 have a size of 350 pixels x 95 
lines. The inconsistency may indicate potential fraud. 

In certain embodiments, fraud may be assessed from variances in writing within 
30 an information field of a document, such as a payment instrument. Variances may occur 
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when a forger alters a specific portion of a document, such as a payment instrument. For 
example, a forger may alter the amount in the courtesy amount field by writing in one or 
more additional numbers. Therefore, an information field may contain written characters 
of the same type written both by a forger and by designated individuals, such as one or 
5 more of the account owners. 

FIG. 39 depicts an embodiment of a method of assessing a document, such as a 
payment instrument using a computer system. The method may include providing a 
document to a computer system, as shown in step 592. In an embodiment, the document 

10 may include at least one information field. In some embodiments, writing in at least one 
of the information fields of the document may include at least two examples of a type of 
written information. Writing may include, but is not limited to types of characters, 
words, symbols, and/or other writing features. Other writing features may include, but 
are not limited to local slant, global slant, or pen thickness. For example, written 

15 characters may include one or more character types, as described herein. An information 
field may include written characters of the same character type. For instance, a courtesy 
amoxmt field of a payment instrument may read '3,740.53.' This courtesy amount field 
includes two examples of a type of written information, the first '3' and the second '3.' 
In some embodiments, the writing may include an image. Altematively, the writing may 

20 include a mathematical representation. In an embodiment, examples of types of written 
information may be recognized fi-om an image and/or a mathematical representation. 

The method may further include, as shown by step 594, comparing at least two of 
the examples of the type of written information. At step 596, the method may 

25 additionally include assessing whether two or more of the examples approximately 
match. In the case cited above, the first and the second '3' may be compared to assess 
whether they match. One embodiment may include comparing images of examples of a 
type of written information with image comparison software. Altematively, examples of 
a type of written information may be converted to mathematical representations. In this 

30 case, the handwriting features of the mathematical representations may be compared. If 
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at least two of the examples of written information do not approximately match, the 
computer may generate a flag indicating that the document may be potentially fraudulent. 

In some embodiments, the method may further include comparing the examples of 
5 types of written information to at least one writing profile representation. For example, 
examples of types of handwritten characters may be compared to writing profile 
representations in the digit/alpha profile. 

Some embodiments may include a method of assessing fraud from variances in 
10 writing between different information fields of a document. Li one embodiment, a 

document may include at least two information fields. At least two information fields of 
the document include at least one example of a type of written information. The method 
may include comparing at least two of the examples in at least two of the information 
fields. The method may further include assessing whether two or more of the examples 
15 approximately match. For instance, numerals in a date field and numerals in a courtesy 
amount field of a payment instrument may be compared. For example, a courtesy amount 
field may read '3,340.53.' A date field of the same payment instrument may read 
' 1/4/03.' Both the courtesy field and the date field include examples of a written '4,' 
which may be compared. If at least two of the examples of the type of written 
20 information do not approximately match, the computer may generate a flag indicating that 
the document may be potentially fraudulent. 



FIG. 40 is an illustration of assessing fraud from variations in handwriting in the 
same information field and between different information fields of a giro. FIG. 40a 
25 depicts amount field 372 and credit account field 374 from a giro. Two apparent 

differences may be assessed between the two information fields. First, there is a 
difference in ink thickness between information fields. Second, there is a difference in 
the style of the '6' between the two information fields. 
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FIG. 40b depicts amount field 376 and credit account field 378 firom a giro. There 
is a difference in style between the two '5's in the account field. In addition, there is a 
difference in style of the '2' is the amount field and the '2' in the account field. 



5 FIG. 40c depicts amount field 380 and credit account field 382 fi-om a giro. 

Variations exist in ink thickness for digits in the account field. In addition, there is a 
variation in slant for digits in the account field. Also, a difference in style is exhibited for 
the '3' and '5' between the amount and accoimt fields. 

10 FIG. 40d depicts amount field 384 and credit account field 386 fi^om a giro. There 

are differences in style for the '7', '2\ and '4' between the amoxmt and account field. In 
addition, there is a variation in slant between digits in the account field. 

In one embodiment, a handwriting profile, such as a payment instrument accoimt 
15 profile, may include a database of previously identified forgers. The database may further 
include a forger writing profile for one or more identified forgers. The forger writing 
profile is analogous to the writing profile for authorized writers, such as one or more 
account owners, shown in Table 1 . A forger profile may include at least some writing 
profile information obtained from previously identified forged documents, such as 
20 payment instruments, associated with a forger. 

FIG. 41 depicts an embodiment of a method for identifying a document 
comprising forged information using a computer system. The method may include 
providing a document to the computer system, as shown by step 612. In one 

25 embodiment, the document may include at least one information field. In another 

embodiment, the document may include at least two information fields. As shown at step 
614, the method may further include comparing writing in at least two of the information 
fields of the document to at least one forger writing profile representation from at least 
one information field. In another embodiment, the method may include comparing 

30 writing in at least one of the information fields of the document to at least one forger 
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writing profile representation firom at least two information fields of at least one forged 
document. In some embodiments, writing in at least one of the information fields of the 
document maybe compared to at least one forger writing profile representation fi-om at 
least two information fields of at least one forged document 

5 

Additionally, as shown at step 615, the method may include identifying the 
document as a document comprising forged information firom an approximate match of at 
least one forger writing profile representation with writing in the document. The method 
may fiirther include identifying the forger of the document from the forger writing profile 
10 if the docxmient is identified as forged, as indicated by step 616. 

For many financial services companies keying labor represents a large data 
capture cost. For example, one of the most expensive keying operations is the keying of 
data fi-om a written information field, such as a payee name. Keying labor may be 
15 reduced through application of a writing profile, such as a payment instrument account 
profile. A profile, as described in Table 1 , may be used to capture written information in 
information fields of dociunents, such as payment instruments presented for payment. 



FIG. 42 depicts an embodiment of a method of capturing written information fi-om 
20 an information field of a document using a computer system. The method may include 
providing a document to the computer system, as shown by step 618. In one 
embodiment, the dociunent may include at least one information field. In one 
embodiment, the document may include at least two information fields. The method may 
further include assessing whether writing in an information field approximately matches a 
25 writing profile representation fi^om at least one information field fi-om at least one other 
document, as indicated by step 620. In an embodiment, the matching writing profile 
representation is associated with a corresponding text representation. 
In some embodiments, the text representations may be stored in memory in a computer 
processable format on the computer system. For example, the computer processable 
30 format may be ASCII format. As shown by step 622, the information field may then be 
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associated with the text representation corresponding to the matching writing profile 
representation. The writing in the information field may be a mathematical 
representation, as described herein. The matching text representation may be assessed 
fi-om features included in writing profile representations that include mathematical 
5 representations fi-om the writing profile. 

FIG. 43 illustrates the capture of an entry in an information field of a payment 
instrument. Samples 368 represent entries in payee fields extracted fi-om valid checks. 
The entries may be included in a checking account profile and may be stored as 
10 mathematical representations. Sample 370 is an entry in a payee field of a check is to be 
captured. Sample 370 may be identified as "NORTHGATE fflGH SCHOOL" from the 
handwriting features in the mathematical representations in samples 368. 

Further modifications and altemative embodiments of various aspects of the 
invention may be apparent to those skilled in the art in view of this description. 
Accordingly, this description is to be constmed as illustrative only and is for the purpose 
of teaching those skilled in the art the general manner of carrying out the invention. It is 
to be understood that the forms of the invention shown and described herein are to be 
taken as the presently preferred embodiments. Elements and materials may be substituted 
for those illustrated and described herein, parts and processes maybe reversed, and 
certain features of the invention may be utiHzed independently, all as would be apparent 
to one skilled in the art after having the benefit of this description of the invention. 
Changes may be made in the elements described herein without departing fi:om the spirit 
and scope of the invention as described in the following claims. 



15 
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